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Abstract 

We consider the indexable dictionary problem, which consists of storing a set S C 
{0, . . . ,m — 1} for some integer m, while supporting the operations of rank(a;), which 
returns the number of elements in S that are less than x li x G S, and —1 otherwise; 
and select(i) which returns the i-th smallest element in S. We give a data structure 
that supports both operations in 0(1) time on the RAM model and requires B{n, m) + 
o(n) + 0(lg Ig m) bits to store a set of size n, where B{n, m) = [Ig (^)] is the minimum 
number of bits required to store any n-element subset from a universe of size m. 
Previous dictionaries taking this space only supported (yes/no) membership queries 
in 0(1) time. In the cell probe model we can remove the O(lglgm) additive term in 
the space bound, answering a question raised by Fich and Miltersen, and Pagh. 

We present extensions and applications of our indexable dictionary data structure, 
including: 

• an information-theoretically optimal representation of a fe-ary cardinal tree that 
supports standard operations in constant time, 

• a representation of a multiset of size n from {0, . . . , m — 1} in B{n, m + n) + o{n) 
bits that supports (appropriate generalizations of) rank and select operations in 
constant time, and 

• a representation of a sequence of n non-negative integers summing up to m in 
i3(n, m + n) + o{n) bits that supports prefix sum queries in constant time. 

1 Introduction 

Given a set S oi n distinct keys from the universe {0, . . . ,m — 1}, possibly the most fun- 
damental data structuring problem that can be defined for 5* is the dictionary problem: to 
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store S so that membership queries of the form "Is x in S7" can be answered quickly. In 
his influential paper [22], Yao considered the complexity of this problem and showed that 
the sorted array representation of S is the best possible for this problem, if one considers a 
suitably restricted class of representations. Since membership queries take fl{lgn) time to 
answer using a sorted arrajQ, a number of researchers have developed representations based 
on hashing that answer membership queries in constant time (see e.g. [29| [28| [T3l [6l [21] )• 

However, one extremely useful feature present in the sorted array representation of 5* 
is that, given an index i, the i-th smallest element in 5* can be retrieved in constant time. 
Also, when the presence of an element x has been established in a sorted array, we know the 
rank of x, i.e., the number of elements in S that are less than x. Schemes based on hashing 
work by "randomly scattering" keys, and do not intrinsically support such operations. It 
is natural to ask whether one can represent 5* in a way that combines the speed of hash 
tables with the additional functionality of sorted arrays. We therefore consider the problem 
of representing S to support the following operations in constant time: 

rank(x, S) Given x E {0, . . . ,m — 1}, return —1 if x ^ S and \{y G S\y < x}\ otherwise, and 

select(i, S) Given i G {1, . . . , n}, return the i-th smallest element in S. 

When there is no confusion, we will omit the set S from the description of these operations. 
We call this the indexable dictionary problem, and a representation for S where both these 
operations can be supported in constant time an indexable dictionary representation. 

Our interest lies in succinct representations of S, whose space usage is close to the 
information-theoretic lower bound. Motivated by applications to very large data sets, as 
well as by applications to low-resource systems such as handheld and embedded computers, 
smart cards etc., there has been a renewal of interest in succinct representations of data 
[3 El |20l [m im [221 |23l IMl [2l]. In the context of this paper, the information-theoretic lower 



one cannot represent an arbitrary set of n keys from {0, . . . , m — 1} in fewer than B{n, m) = 



As B{n,m) = nlg{em/n) — O(lgn) — Q{v?/m) [23], a sorted array representation of 
S*, which takes n [Igm] bits, can be significantly larger than the information-theoretic lower 
bound. Brodnik and Munro [6j were the first to give a succinct representation that supported 
constant-time membership queries. Pagh [21] improved the space bound to B{n, m) +o{n) + 
O(lglgm) bits, while continuing to support membership queries in constant time. Raman 
and Rao [26] considered dictionaries with rank, which support constant-time rank queries 
and gave a representation requiring n [Igm] -|- O(lglgm) bits of space; this is better than 
augmenting Pagh's data structure with n [Ign] bits of explicit rank information. Raman 
and Rao's data structure can also support select queries using n([lgm'| -|- [Ign]) -l-O(lglgm) 
bits, but this is nearly 2n\gn bits more than necessary. All the above papers [SI 1211 12S] 
assume the standard word RAM model with word size 6(lgm) bits [T1[T9]; unless specified 
otherwise this is our default model. 

^\gx denotes logj x. 
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Ig ("^j bits, and we seek representations that use space close to B{n,m). 
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1.1 Our results 



1.1.1 Indexable dictionaries 

We give an indexable dictionary representation that requires B{n,m) + o{n) + O(lglgm) 
bits to store a set of size n from {0, ... ,m — 1}. Modifying this data structure, we get an 
indexable dictionary representation that requires i3(n, m) + o{n) bits and supports operations 
in 0(1) time in the cell probe model [29] with word size 0(lgm). The significance of this 
modest improvement in space usage is as follows. Since B{n,m) + o{n) < n [Igm] for all n 
larger than a sufficiently large constant, this result shows that n words of [Igm] bits suffice 
to answer membership queries in constant time on a set of size n, and answers a question 
raised by Fich and Miltersen [12] and Pagh [21]. By contrast, Yao showed that if the n words 
must contain a permutation of S, then membership queries cannot be answered in constant 
time [29]. 



N. Note 
lich is close to n(lg A; + Ig e). 



1.1.2 Applications of indexable dictionaries 

Using the indexable dictionaries, we obtain the following results: 

• A k-ary cardinal tree is a rooted tree, each node of which has k positions labeled 
0, . . . , A; — 1, which can contain edges to children. The space lower bound for repre- 
senting a fc-ary cardinal tree with n nodes is C{n, k) 
that C{n,k) = {klgk - {k - l)\g{k - l))n - Oilg{kn)), wl 
as k grows. Benoit et al. [5J gave a cardinal tree data structure that takes ([Igfc] + 
2)n + o{n) + O(lglgfc) = C{n, k) + Q{n) bits and answers queries asking for parent, 
i-th child, child with label i, degree and subtree size in constant time. We obtain an 
encoding for fc-ary cardinal trees taking C{n,k) + o{n) + 0(lglg/c) bits, in which all 
the above operations, except the subtree size at a node, can be supported in constant 
time. Both the above results on cardinal trees use the word RAM model with a word 
size of 0(lg(/c + n)) bits. 

• Let M be a multiset of n numbers from {0, . . . , m — 1}. We consider the problem of 
representing M to support the following operations: 

rankm(x, M) Given x E U, return —1 if x ^ M and \{y G M\y < x}\ otherwise, and 

selectm(2,M) Given i G {l,...,n}, return the largest element x G M such that 
rankm(x) < i — 1. 

ran km and selectm are natural generalisations of rank and select to multisets. It is easy 
to see that B{n, m + n) is a lower bound on the number of bits needed to represent 
such a multiset, as there is a 1 — 1 mapping between such multisets and sets of n 
elements from {0, . . . , m + n — 1} [TT]. However, if we transform a multiset into a set 
by this mapping, then rankm and selectm do not appear to translate into rank and select 
operations on the transformed set. Using some additional ideas, we obtain a multiset 
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representation that takes B{n,m + n) + o{n) + 0(lglgm) bits, and supports ran km and 
selectm in constant time. This result assumes a word size of B(lg(m + n)) bits. 

Ehas [TT] previously considered the problem of representing multisets succinctly while 
supporting selectm and the following generalization of ran km: 

fullrankm(x) Given x G f/, return \{y G M\y < x}\. 

He considered the bit-probe model rather than the word RAM model, and was con- 
cerned with average-case behaviour over all possible operations. Our results on FIDs 
(see below) have consequences for this version of the problem. 

1.1.3 Fully indexable dictionaries and prefix sums 

We also give a subroutine that appears to be of independent interest. Given a sequence a of 
m bits, define the following operations, for 6 G {0, 1}: (a) rankb(z) - count the number of 6's 
before the position i in a, and (b) select;, (z) - find the position of the i-th b in a. It is shown 
in [71 [23] how to represent cr in m + o(m) bits and support these four queries in constant 
time. This data structure is a fundamental building block in a large number of succinct data 
structures [3 [2S1 [Ml HZl 1231 [21] • 

One can also view a as the characteristic vector of a subset S of n keys from U = 
{0, ...,m — 1}, and define a fully indexable dictionary (FID) representation of S to be 
one that supports the operations rank(x, 5"), select(2, 5), rank(x, S") and select(i, S") all in 
constant time, where S = U \ S is the complement of the set 5*. It is easy to see that an 
FID representation is functionally equivalent to a bit- vector supporting ranko/i and selecto/i. 
Extending a result due to Pagh pi], we give an FID representation for 5* that takes B{n, m) + 
0((mlglgm)/lgm) bits. This is always at most m + o(m) bits, but it may be substantially 
less: for example, whenever m/ y/lgm < n < m{l — 1/ylgrn), the space usage is at most 
B{n, m) + o{n) = (1 + o(l))S(n, m) bits. We give the following apphcation of this result: 

• We can store a multiset M of n values from {0, . . . , m — 1} to support selectm (but 
not rankm) in constant time using B{n, m + n) + o{n) bits. Another way of stating this 
result is that we can represent a sequence of n non-negative integers X = xi, . . . ,Xn, 
such that Xj = m, so that the query sum{i,X), which returns J^j^iXj, can be 
answered in constant time using B{n, m + n) + o(n) bits. 

The problem of representing integers compactly so that their prefix sums can be computed 
efficiently has been studied by a number of researchers including [HI [TTl HSl |Ml [2H]- Our 
solution is more space-efficient than all of these. The result of Grossi and Vitter fTli Lemma 
2], which is based on Elias's ideas, is the previously most space-efficient one and requires 
n(("lgm] — \ lgn\ +2) + o{n) bits to represent n non-negative integers adding up to m, where 
m > n. In most cases, this will be B(n) bits more than optimal. When n and m are not 
powers of 2, the ceilings and floors are a source of non-optimality; for example, take m = n 
with m not a power of 2; Grossi and Vitter's method requires 3n -|- o{n) bits in the worst 
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opposed to the lower bound of B{n,2n) = 2n — O(lgn) bits. Another source of 
non-optimahty is that the constant 2 is not optimal; for example, take m = cn where n and 
m are powers of 2 and c > 1. Grossi and Vitter's method requires (2 + Ig c)n + o{n) bits in 
the worst case, which can be easily shown to be at least (2 — (1 + c) lg((l + c)/c))n = Q{n) 
bits more than optimal (the difference tends to (2 — lge)?T, as c increases). On the other 
hand, our representation is always within o{n) bits of optimal. 

1.1.4 Lower bounds 

It is important to note that, appearances notwithstanding, some of the space bounds above 
may actually be much larger than the information-theoretic lower bound of B{n,m). For 
example, consider the space bound of B{n,m) + o(n) + O(lglgm) bits for storing a set 
S of size n from {0, ... ,m — 1} in an indexable dictionary representation. If n < m/2, 
B = B{n,m) > max{n, Igm} and this space bound is indeed B plus lower-order terms. 
However, as n gets very close to m, B can be much smaller than the o{n) term. If we only 
want to answer membership queries, we can assume n < 112/2 without loss of generality: if 5* 
has more than m/2 elements then we store its complement and invert the answers. However, 
in the indexable dictionary problem, it is not clear how answering rank and select queries on 
a set could help us to answer these queries on its complement in constant time. In fact, we 
note that if we could store a set S in B'^^^^ bits for all n and m, and support select (or rank) 
in constant time, then we could also support fullrank queries on S in constant time using 
B^^^^ bits. Here fullrank(x, S") returns the rank of x in S" for any x & U. It is known that 
in general, fullrank queries cannot be answered in constant time in the RAM model (or even 
in the cell probe model) while using n^^^'> words of (Igm)*^*^^^ bits each [H Corollary 3.10]. 
Thus, many of our space bounds are of necessity not information-theoretically optimal in 
some cases; one exception is the space bound for fc-ary trees, which is optimal for all A; > 2. 

1.2 Techniques used 

The main ingredient in our indexable dictionary representation is most- significant-bit first 
(MSB) bucketing. The idea is to apply a trivial top-level hash function to the keys in S, 
which simply takes the value of the t most significant bits of a key. As we can omit the t 
most significant bits of all keys that "hash" to the same bucket, space savings is possible. 
A similar idea was used by Brodnik and Munro [6] in their succinct representation of sets. 
A major difference between our approach and theirs is that they store explicit pointers to 
refer to the representation of buckets, which uses more space than necessary (and hence 
constrains the number of buckets). Instead, we use a succinct representation of the prefix 
sums of bucket sizes that not only provides the extra functionality needed for supporting 
rank and select, but also uses significantly less space. The related technique of quotienting 
[211 19] stores only the "quotients" of keys that are mapped to a bucket by a standard hash 
function (e.g. those of [13] )• The crucial difference is that MSB bucketing preserves enough 
information about the ordering of keys to allow us to maintain most of the rank information 
using negligible extra space. 
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Other ideas relevant to the indexable dictionary representation are range reduction ([T3] 
and others), distinguishing bits and others) and techniques for compactly represent- 
ing hash functions for several subsets of a common universe developed in 0. Our fc-ary 
tree representation does not encode the tree structure explicitly, a feature shared with the 
representation of [TO] . 

1.3 Organization of the paper 

The remainder of this paper is organized as follows. In Section 2, we give some building 
blocks that will later be used in our main results. Extending the dictionary with rank of 
[26] [5] , we first give a simple indexable dictionary that uses about 2n Ig n bits more than 
necessary. Then we show the connection between fully indexable dictionaries and prefix sum 
data structures and give some simple representations for both. These are then used in Section 
3, coupled with MSB bucketing, to obtain an improved result on indexable dictionaries, which 
reduces the space wastage to about 0{n) bits. 

In Section 4, we first develop a B{n,m) + 0((mlglgm)/ lgm)-bit fully indexable dictio- 
nary representation, extending a result of Pagh [21] . Using this and our result from Section 
3, we obtain our main result: an indexable dictionary taking B{n,m) + o{n) + O(lglgm) 
bits. In Section 5, we remove the O(lglgm) term in the space bound by moving to the cell 
probe model, giving a representation that takes B{n, m) + o{n) bits. Section 6 gives some 
applications of our succinct dictionaries to representations of multiple dictionaries, fc-ary 
trees, multisets and prefix sums. Section 7 makes some observations about the difficulty of 
achieving optimal space for all values of the input parameters. Section 8 recapitulates the 
main results and gives some open problems. 

2 Preliminaries 

In this section, we first establish connections between FIDs and prefix sums, and we end 
with simple representations of multiple indexable dictionaries and prefix sums. 

In what follows, if / is a function defined from a finite set X to a finite totally ordered 
set F, by ||/||, we mean max{/(x) : x G X}. We use the notation [m] to denote the set 
{0,l,...,m-l}. 

2.1 Fully Indexable Dictionaries and Searchable Prefix Sums 

Given a set S* C [/, recall that a fully indexable dictionary (FID) representation for S 
supports rank and select operations on both S and its complement S = U \ S m. 0(1) time. 
FIDs are essential to our data structure as they are intimately related to operations on prefix 
sums, as we note below. 

Given a sequence X oi n non-negative integers xi,...,Xn such that YIi=iXi = m, the 
searchable prefix sum problem is to find a representation of this sequence that supports the 
following operations in constant time: 
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sum(i, X) Given i E {1, . . . , n}, return Yl]=i 

pred(a;, X) Given x G [m], return max{i < n\ I]j=i < x}- 

We call a data structure that stores the sequence X to support the queries in constant 
time an (n, m)-searchable prefix sum data structure. We now make the connection between 
FIDs and the searchable prefix sums problem 

Lemma 2.1 Suppose that there is an FID representation for a given set S ^ U that uses 
f{\S\, \U\) bits. Then given a sequence X of non-negative integers xi,X2, ■ ■ ■ such that 
J27=i Xi = m there is an [n, m) -searchable prefix sum data structure for X using f{n, m-\-n) 
bits. 

Proof: Consider the following m + n bit representation of the sequence X. For i = 1 to 
n, represent Xi by Xj Os followed by a 1. Clearly this representation takes m + n bits since 
it has m Os and n Is. View this bit sequence as the characteristic vector of a set 5* of n 
elements from the universe [m + n]. Represent 5* as an FID using f{n,m-\-n) bits. It is easy 
to verify that pred(a;, X) = select(a;, S) — x + 1 and sum(z, X) = select(i, S) — i + 1. (Recall 
that [m + n] begins with 0.) □ 

Lemma 2.2 Given a set S C [m], there is an FID for S taking m + o(m) bits. 

Proof: Consider the characteristic vector of S, which is a bit-vector of length m. It is 
shown in [201 13 EB [23] how to represent this bit- vector using m + o(m) bits, to support the 
queries rankfe(z) and selectfc(i) in constant time, for b G {0, 1}. It is easy to verify that these 
operations on the characteristic vector suffice to support FID operations on S: for example, 
ranki(j) is given by rank(j, S) if the j-th bit is a 1, and by j — rank(j, S") — 1 otherwise. □ 
The following lemma is an immediate consequence of Lemma 12.11 and Lemma 12.21 

Lemma 2.3 A sequence S of n non-negative numbers whose total sum is m can be repre- 
sented using m + + o{m + n) bits to support sum and pred operations in constant time. 

2.2 A Simple Indexable Dictionary 

We now give a simple indexable dictionary representation for a set S* C [m*], based on 
perfect hashing schemes for membership [T3l [271 [21] • These perfect hashing schemes begin 
with finding a universe reduction function / : [m*] — > [{Sl"^] such that / is 1 — 1 on S". A 
problem with such an approach is that / requires Q{\g\gm*) bits to represent, which can be 
a significant overhead for sets which are very small but nonetheless not constant-sized. This 
overhead becomes significant if we need to store several sets in the data structure, as we pay 
the overhead repeatedly for each set. To reduce this overhead we use the idea of [5], which 
is to note that we do not need / to bring the universe size as far down as \S\'^ for small sets, 
thereby allowing the same / to be used for several (small) sets. 

Thus, it makes sense to talk about representing the set S, but excluding the space cost of 
representing a universe-reduction function. In the following lemma, which is a simplification 
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and extension of a scheme from 0, we use this approach. Here hs is the universe- reduction 
function and qs is a "quotient" function, which gives the information thrown away during 
the universe reduction and is used to recover x given hs{x). 

Lemma 2.4 Let m*,n* > 1 be two given integers, and let S C [m*] be a set of size at most 
n* . Suppose that we have access to two functions hs and qs, defined on [m*], satisfying the 
following conditions: 

1. hs is 1-1 on S . 

2. hs and qs can be evaluated in 0(1) time, and from hs{x) and qs{x) one can uniquely 
reconstruct x in 0(1) time. 

3. \\hs\\ = 0{{n*Y) if \S\ > ^/\gn* and \\hs\\ = 0{(\gn*y) for some constant c > 
otherwise. 

I rig||/i5in + rigiig5in =igm* + o(i). 

Then we can represent S using I^Klgm* + Igj^l + 0(1)) bits and support rank and select 
in 0(1) time. This assumes a word size of at least lgmax{m*,n*} bits, and access to a 
pre-computed table of o{n*) bits and a constant of 0{lgn*) bits that depends only on \ \hs\\, 
and that m* and n* are known to the data structure. 

Proof: Let / = 15*1 and suppose that S contains the elements xi < X2 < ■ ■ ■ < xi. 

If / < -y/Ign* then we write down /^^(xi), . . . , hs{xi) in fields of 6 = [Ig bits each, 

followed by g5(a;i), 55(0;/) in fields of [lg||gs'||] bits each. This requires I^Klgm* + 0(1)) 
bits. To compute rank(a;) we calculate hs{x) and look for a match in hs{xi), . . . , hs{xi). This 
can be done in 0(1) time using standard techniques [251 113 13], provided we have available 
the integer constant k that contains Is in bit positions 0, 6, 26, . . . , 6 ■ [{lgn*)/b\, as well as 
tables that enable us to compute, for every integer x of Ign* or fewer bits, the index of the 
most significant bit that is set to 1 (or, equivalently to compute [IgxJ). If we are unable 
to find an index i such that hs{x) = hs{xi), we return —1, otherwise we verify whether 
qs{x) = qs{xi). If so, return i — I, otherwise return —1. To compute select(i), reconstruct 
Xi from the values hs{xi) and qs{xi) and return it. 

If I > -y/lg n*, then let S' = {hs{x)\x G S}. We create a minimal perfect hash function 
/ • [ll^sll] ~^ m that is 1 — 1 on 5". As shown in [271 H]) there exists such a function 
/ that can be evaluated in 0(1) time and that can be represented in 0(/ + lglg||/i5||) = 
0(/ + Iglgn*) = 0{l) bits. We also store two tables of size /. In the first table R, for 
1 < 2 < /, we store the value i in the location f{hs{xi)) using a total of / [Ig/] bits. In 
the second table X, we store the elements of S in sorted order. Now to answer rank(x), 
we calculate j = R[f{hs{x))] and check if x = xj: if so, then rank(a;) is j — 1, and is — 1 
otherwise. Supporting select is trivial since we have stored the XjS in sorted order in X. □ 

The following lemma from [5] gives the space savings obtained by combining universe 
reduction functions for different sets: 
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Figure 1: Two level MSB bucketing 



Lemma 2.5 ([5]) Let n*,m* be as in Lemma \2.4\ and let < ii < i2 < ■ ■ ■ < is < n* be a 
sequence of integers. Let Si-^, Si^, . . . , Si^ be subsets of [m*] such that J2j=i \Sij\ < n* . Then 



there exist functions hs^. and qs^. for j = 1, . . . , s that satisfy the conditions of Lemma 2.4 



which can be represented in o{n*) +0(lglgm*) bits in such a way that given ij we can access 
hs-. and qs^. in constant time. 
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3 Saving n\gn bits using MSB bucketing 

In this section, we first give a representation that takes about n[lgm] bits to represent a 
set of size n from a universe of size m, and supports rank and select operations in 0(1) 
time (Theorem 13 .11) . We then use this representation to store multiple independent (but not 
necessarily disjoint) dictionaries efficiently (Lemma 13.11) . 

Theorem 3.1 There is an indexable dictionary for a set S C [m], IS"! = n, that uses at 
most n [Igm] + o{n) + O(lglgm) bits of space. 

Proof: Our construction algorithm partitions S using MSB bucketing, recursing on large 
partitions. The base case of the recursion is handled using Lemma 12.41 We get an overall 
space bound of n [Ig rri\ assuming the hypothesis of Lemma 12.41 for each application of this 
lemma. We then show how to support rank and select in 0(1) time. Finally, we sketch 
how to use Lemma 12.5! to represent all functions used in applications of Lemma 12.41 using 
o{n) + O(lglgm) bits. 

Let t = \lg m \ — \\g n] , and let c and d be two constants whose values are to be determined 
later. If n < d, then we store the elements of S explicitly, using n [Igm] bits, and we are 
done. 

Otherwise, if n > d, we partition the elements of 5* according to their top {Ign] bits. 
This partitions S into y = 2^^^"-^ < 2n sets denoted by 5*0, ... , Sy^i, where Si consists of the 
last t bits of all keys in S whose most significant [Ig n] bits have value i, for i E [y]. We store 
a representation of the sizes of these y sets which takes n + y + o{n + y) < 3n + o{n) < An bits 
(for sufficiently large n) using Lemma 12.3! and pad this out to 4n bits. The representation 
of S is obtained by concatenating these 4n bits with the representations of each of the S'i's, 
for i G [y]. 
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The representation of Si, for i E [y], is obtained as follows. Let rii = \Si\. If rii < d, we 
write down the elements of Si using riit bits, and pad this out to riiit + i + c) bits. Otherwise, 
we again partition the elements of Si into z = 2^^^^^'^ sets according to their top [Ig nj] bits, 
denoted as T°, . . . ,T/~^ (see Fig. [1]). We store a representation of the sizes of these z sets 
and pad this out to iui bits. Again, the representation of Si is the concatenation of these 
Arii bits with the representations of each of the T/s, for j G [z]. 

The representation of T/, for j G [z], is obtained as follows. If |T/| < d, then we write 
down its elements using \T-\{t— [Igrij]) bits, and pad this out to \T-\{t + c) bits. Otherwise, 
we store it using the representation of Lemma [2^ (with m* = 2*"^'^"*^ and n* = n), padding 
this out to \Ti\(t + c) bits if necessary. (Note that the representation of T/ using Lemma [2^ 
takes \Ti\(t — [Igrij] + Ig |T/| + 0(1)) bits. Thus it is enough to choose c to be equal to the 
constant in the 0(1) term to guarantee that this is at most |T/|(t + c) bits.) 

When Si is partitioned, its representation takes 4nj + J2jZo \Ti \ {t + c) = Aui + ni{t + c) = 
nj(t + 4 + c) bits. Thus the representation of Si takes ni{t + 4 + c) in either case. Hence 
the length of the representation of S, when it is partitioned, is An + Yl^^Zo ni{t + 4 + c) = 
4n + n{t + 4 + c) = n{t + 8 + c) bits. Thus, in either case, S takes n{t + 8 + c) bits. This is 
at most n [Igm] bits, for sufficiently large d (since t = [Igm] — [Igra]). 

We now describe how the computation of rank proceeds; select works in a similar way. If 
n < d, we apply the trivial algorithm and return. Otherwise, we consider the first An bits of 
the representation of 5", which contains the representation of the sequence a of the sizes of 
the buckets Si, i G [y]. We extract the the top \\gn~\ bits of the current kej{§; suppose that 
these bits have value i. Using Lemma [2.31 we calculate p = sum(i — 1, cr) and p' = sum(z, a) 
in 0(1) time; note that p' — pis the size of the set Si to which the current key belongs. The 
start of the representation of Si is also easy to compute: it starts An + p{t + 4 + c) bits from 
the start of the representation of S. We then remove the top [Ig |T|] bits from the query key, 
add the rank of the resulting key in the set Si to p and return. Thus the problem reduces to 
finding the rank of a key in some set Si. 

If 1 1 < d, then we apply the trivial algorithm to find the rank of a key in Si. Otherwise, 
we apply a similar algorithm as above to reduce the problem to finding the rank of a key 
in some set T/. Again, if |T/| < d, then we apply the trivial algorithm to find the rank. 
Otherwise, since T/ is stored using the representation of Lemma [27il we can support rank in 
constant time. The overall computation is clearly constant-time. 

It is easily verified that n* = n is an appropriate choice for all applications of Lemma 12.41 
above. We now verify that the additional space required (in terms of the pre-computed table 
and constants) is not excessive. Firstly, the pre-computed table is of size o{n*) = o{n) bits 
and is common to all applications of Lemma [2.4[ At most 0(lg?T,) constants are required, 
one for each possible value of b = llgWhsW], which require negligible space. 

We now discuss the use of Lemma 12.51 to represent the functions for all the base-case 
sets. The lemma requires that there is a numbering of the sets using integers from [n], but 
we can simply take the number of a set to be the sums of the cardinalities of the sets whose 
indices are less than its own index. This information must be computed anyway during rank 

^Standard techniques allow us to calculate \\gx] in constant time [15]. 
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and select. Finally, the space required for representing the functions is o{n) + 0(lglgm) bits. 
This completes the proof of Theorem 13.11 □ The following lemma is an easy extension of 
Theorem 13.11 

Lemma 3.1 Let 81,82, ■■■,Ss all contained in [m] be given sets with 8i containing Ui 
elements, such that Yl,l=i = Then this collection of sets can be represented using 
n [Igm] +o{n) +0(lglgm) bits where the operations rank(x, 8i) and select(j, 8i) can be sup- 
ported in constant time for any x e [m], 1 < j < n and 1 < i < s. This requires that we 
have access to a constant-time oracle which returns the prefix sums of the rii values. 

Proof: If we apply Theorem 13.11 directly to each set 8i we get a representation taking 
Ylii=i i^i Rg"^l + o{ni) + 0(lglgm)) = n [Igm] + o{n) + O(slglgm) bits, that supports 
rank and select on each set in 0(1) time. The beginning of the representation of each set can 
be calculated using the oracle supporting the prefix sum queries in constant time. To get the 
claimed space bound, we apply Theorem 13. II to represent each 8i, but with the modification 
that Lemma [275] is used only once across all applications of Theorem 13. II The only change this 
causes is that we need a global numbering (using indices bounded by n) of all base-case sets 
created when applying Theorem 13. II to the S'j's. Recall that when applying Theorem 13. II to a 
particular set 8i, we give each base-case set that is created a 'local' number bounded by n,. 
Thus, an appropriate global number for a base-level set created when applying Theorem 13.11 
to 8i is just its local number plus Z]}=i This gives the claimed bound. □ 

4 Obtaining a sublinear lower-order term 

In this section, we develop the main result of the paper, namely, a representation for an 
indexable dictionary taking B{n, m) +o(ra) + 0(lglgm) bits of space. We begin by observing 
that the bound of Theorem 13. II is better than claimed: it is actually B{n, m) + 0(n + lglgm) 
bits. The constant factor in the 0{n) term can be improved by means of one more level of 
MSB bucketing, as follows. We place the keys into 2l-'s"J buckets based upon the first [IgnJ 
bits of each element. We represent the sizes of these buckets using at most 2n-\-o{n) bits via 
Lemma 12.31 This partitions the given set into multiple (up to n) sets which contain keys of 
[Ig m] — [Ig n\ bits each; the collection of sets is then represented using the data structure of 
Lemma \37i\ The resulting dictionary takes at most n([lgm] — [IgnJ +2) + o(n) + 0(lglgm) 
bits and supports rank and select in constant time. 

Recalling the discussion on representing prefix sums in the introduction, this bound is 
also non-optimal by G(n) bits in many cases. In addition to redundancy caused when m 
and n are not powers of 2, the constant 2 is not optimal. For example, when m = en for 
some constant c > 2, the disparity in this case is (2 — clg(c/(c — l)))n bits, which tends 
again to about (2 — lge)n bits for large c. To bring the linear term of the space bound closer 
to optimal, we place the keys into Q{ny/\gn) buckets; this will also enable us to 'remove' 
the ceilings and floors in the bound. However, using a super-linear number of buckets uses 
too much space if we use Lemma 12.31 to represent their sizes. Hence, we now develop a 
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much more space-efficient alternative to Lemma 12. 3[ by giving more space-efficient FIDs. In 
particular, we show the following lemma which is an extension of [231 Proposition 4.3]. 

4.1 Fully Indexable Dictionaries for Dense Sets 

Lemma 4.1 Given a set S C [m], l^l = n, there is an FID on S that requires B{n,m) + 
0(mlglgm/lgm) bits of space. 



Proof: Take u = 
each, with the i-t 



ilgmj and divide the universe [m] into p = \m/u] blocks of u numbers 
1 block Ui = {{i — l)u, . . . , m — 1}, for 1 < i < p — I, and Up = {(p — 
1)m, . . . , m— 1}. Let Si = SnUi and Hi = \Si\. Clearly, we can view as a subset of [m] , which 
we now do for convenience. The set Si is represented implicitly by a string of B{ni, u) bits by 
storing an index into a table containing the characteristic bit vectors of all possible subsets 
of size Hi from a universe of size u. S is represented by concatenating the representations 
of the S'j's; the length of this representation of S is at most B{n,m) + 0{m/ \gm) bits, as 
shown in [6]. 

To enable fast access to the representations of the S'jS, we store two arrays of size p. 
The ffist array A stores the numbers rii in equal-sized fields of \\gu\ bits each. The second 
array B stores the quantities B{ni,u); since B{ni,u) < u these numbers can also be stored 
in equal sized fields of \\gu\ bits each. This requires 0(m Iglgm/lgm) bits of space. We 
also store the prefix sums of the two arrays, as described in [24i Proposition 4.2] or [28], in 
0(m Ig Ig m/ Igm) bits, such that the i-th prefix sum is calculated in 0(1) time. We also 
store precomputed tables to support rank and select queries on an arbitrary set Si given its 
size and its implicit representation. These tables require 0{m}~''') bits of space for some 
fixed positive constant e < 1. 

To find rank(x) we proceed as in [23]: ffist compute i = [x/mJ, find the number of 
elements in 5*0 U ... U Si-i using the partial sum data structure for the array A, index into 
the string for S to get the representation of Si using the partial sum data structure for the 
array B, and find the rank of x within the set Si using a table lookup. 

To support select we do the following. We let v = [(Igp)^J and q = [n/v\. We store an 
array C of size q + 2 such that C[0] = 0, C[q + 1] = p, and for j = 1, . . . , q, C[j] stores the 
index I < p such that ^'=1^4 < jv < S'^^^nj. The array C takes 0{n/\gp) = 0{n/\gm) 
bits and allows select(ji;) for j = 1, . . . , g to be answered in 0(1) time, as follows. Letting 
k = C[j], we use the partial sums of B to extract the representation of Sk, use the partial 
sums of A to calculate s = J2i=i rii, and use table lookup to return the (jv — s)-th element 
from Sk as the final answer. 

To support select for arbitrary positions, we follow the ideas of [3 [23]. For i = 1, . . . , q+1, 
we define the i-th segment as ^^=c[i^i]+iUj] i.e., the part of the universe that lies between 
two successive indices from C. As v > u for sufficiently large m, C[i] > C[i — 1] for all 
1 < z < g, and all segments (except perhaps the last) are nonempty. We call a segment 
dense if its size is at most (Igp)^ and sparse otherwise. 

For each sparse segment, we explicitly list (in sorted order) the elements of 5* that lie in 
that segment. The space required to represent the elements of S that lie in a sparse segment 
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is therefore 0((lgp)^ ■ Igm), but since there are at most m/(lgp)^ sparse segments, this 
adds up to 0{m/ Igm) bits overall. For a dense segment, we construct a complete tree with 
branching factor ^/\gp , whose leaves are the blocks that constitute this segment. Since the 
number of leaves is 0((lgp)^), the depth of this tree is constant. At each node of this tree, we 
store an array containing the number of elements of S in each of its child subtrees. If the tree 
for a dense segment has k leaves, the space usage for this tree is 0{k Ig Igp) bits. As segments 
are disjoint and the total number of blocks is 0(m/lgm), this adds up to 0(mlglgm/lgm) 
bits overall. We store explicit pointers to the beginning of the representation of each segment, 
which takes 0(m/lgm) bits as there are only 0(m/(lgm)^) segments. 

The representations of all sparse segments are stored consecutively, as are the representa- 
tions of all dense segments. A bit-sequence of length g + 1, where the i-th bit of the sequence 
is 1 if the z-th segment is sparse and otherwise, is used to distinguish between the two 
cases; this bit sequence is stored as a FID using Lemma [2.21 Using rank operations on this 
FID, we can access the representation of the z-th segment, be it sparse or dense. 

To compute select(z) we first identify the segment in which the z-th element can be found. 
Letting ki = C[[i/v\], by inspecting the prefix sums of A at positions ki and ki + 1 one 
can determine whether the i-th element belongs to the segment ending at ki or the one 
beginning at ki + 1. Suppose it belongs to the segment a. Using the prefix sums of A, we 
determine the rank of the element to be selected in a. If a is sparse we read the required 
element directly from a sorted array. Otherwise, if a is dense, we start at the root of the 
tree corresponding to a and do a predecessor search among the numbers stored in the array 
stored at that node to find the subtree to which the required element belongs. This can 
be done in constant time via table lookup using tables of negligible size, as the array at 
each node takes 0{y/lgplg\gp) = o(lgm) bits. Thus, in constant time we reach a leaf that 
corresponds to some block Sj which is known to contain the element sought. We find the 
number of elements s in 5*0 U . . . U Sj-i using the partial sum data structure for the array A, 
index into the string for S to get the representation of Sj using the prefix sum data structure 
for the array B, and find the position / of the {i — s)-th element in the representation of Si 
using a table lookup. 

Now we consider supporting rank and select operations on S. Again letting Si = SdUi and 
Hi = \Si\, we observe that fii = u — Ui, and so the prefix sums of A suffice to answer prefix sum 
queries on the UiS. Likewise, the implicit representation of Si is also an implicit representation 
of Si and the concatenated representations of the SiS is also an implicit representation of 5* 
that takes only B{n, m) + 0{m/ Igm) bits, from which the representation of a single Si can 
be retrieved in 0(1) time using the array B. Thus, answering rank queries on 5* requires no 
additional information except new tables (of negligible size) for performing rank and select 
on the implicit representations of the SiS. 

To answer select queries on S, we create an array C which is analogous to the array 
C, and which partitions the universe anew into segments. Selecting elements from 5^ in 
these segments is done as before, with trees for dense segments and sorted arrays for sparse 
segments. This requires 0(mlg Igm/ Igm) additional auxiliary space. □ 
Remark: By replacing the implicit representations of the S'j's with the characteristic vector 
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of set 5*, we get a representation of a bit- vector of length m that takes m + 0(mlglgm/ Igm) 
bits and supports rank^ and selectfe queries, for b G {0, 1} (defined in Section [TTT]) . in constant 
time. This improves the lower-order term in space of the earlier known data structures [71 
from O (m/ Ig Ig m) to O (m Ig Ig m/ Ig m) . □ 
As an immediate consequence of Lemma [4.11 we get: 

Corollary 4.1 There is a fully indexable dictionary representation for a set S C [m], \S\ = n 
that uses B{n,m) + o{n) bits of space, provided that m is 0{n^J\gn). 

The following corollary is a consequence of Corollary 14.11 and Lemma 12.11 Note that 
B{n, m + n) is the information theoretic minimum number of bits to represent a multiset of 
n elements from [m]. 

Corollary 4.2 If m = 0{ny/lgn) , then a sequence S of n non-negative numbers that sum 
up to m can be represented using B{n,m, + n) +o{n) bits to support sum and pred operations 
in constant time. 



4.2 Optimal Bucketing for Sparse Sets 

In this section we prove our main result. A key idea will be to use MSB bucketing to place 
keys into uj{n) buckets, and the following proposition will be used to bound the increase in 
space usage: 

Proposition 4.1 For all integers x,y,c> 0, y > x, B{x, y + c) — B{x, y) = 0{cx/y + Igx + 
x^/y). 

Proof: We begin with the estimate B{x,y) = x\g{ey/x) — O(lgx) — Q{x^/y) [211 Equation 
1.1]. From this it follows that B{x,y + c) — B{x,y) = 0{x\g{{y + c)/y) + \gx + x^/y) 
= 0{cx/y + \gx + x'^/y). □ 
Now we use Corollary 14. 21 to prove our main result: 

Theorem 4.1 There is an indexable dictionary for a set S ^ [m] of size n that uses at most 
B{n,m) + o{n) + 0{\g\gm) bits. 

Proof: First, if m < Auy/lgn then we use Corollary 14.11 which establishes the result. If 
m > 4:n^/\g n, we choose an integer / > such that n^/\g n < [m/2'J < 2n\/\gn. We now 
group the keys based upon the mapping g{x) = [a;/2'J . Let r = [(m — 1)/2'J . We "partition" 
S into sets Bi, for z = 0, . . . , r, where Bi = {x mod 2^ \ x E S and g{x) = i}. Let bi = \Bi\, 
for i = 0, . . . , r. We represent the sequence Btop = {bo, . . . ,br) using the data structure of 
Corollary 14.21 taking B{n,r -\- n -\- 1) + o{n) bits, which supports sum and pred on Btop in 
constant time. By Proposition 14.11 B{n,r + n + 1) — B{n,r) = 0{n^/r + \gn) = o{n) as 
r = Q{n^/\gn). Thus, the space usage is B{n,r) + o(n) bits. 

The overall representation is the following. First we represent Btop as above. Then we 
represent each of the i?i's using the data structure of Lemma 13. 1[ The total space used 
will be nl + B{n,r) + o(n) + O(lglgm) bits. Note that B{n,r) = n\g{er/n) + o(n) as 
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r = Q{ny/lgn), and so nl + B{n, r) = nl + n\g{me/ (2'n)) + o{n) = B{n, m) + o{n). Thus, the 
overall space bound is as claimed. The computations of rank and select proceed essentially as 
in Theorem \'S.1\ except that we use Corollary 14.21 instead of Lemma 12. 3[ to represent Btop- 
□ 



5 An indexable dictionary in the cell probe model 

In this section we give an indexable dictionary representation for a set S of size n from a 
universe of size m that uses B{n, m) + o{n) bits of space in the cell probe model [29]. Recall 
that in this model, time is measured as just the number of words (cells) accessed during an 
operation. All other computations are free. We first prove a lemma which is analogous to 
Lemma [2.41 but which does not assume access to the functions hs and qs- 

Lemma 5.1 There is an indexable dictionary for a set S C [m] of size n that uses n(lgm + 
Ign + 0(1)) hits in the cell probe model. 

Proof: Let Xi < 0:2 < . . . < a;„ be the elements of S. 

li n > y/lgm, then we first store the given set S in an array A in increasing order, which 
takes n Igm + 0{n) bits of space. As in Lemma [2.41 we find a minimal perfect hash function 
/ for S and store it using 0{n + Iglgm) = 0{n) bits (since n > ^/\gm). We then store a 
table T with T[f{xi)] = i. This requires nlgn + 0{n) bits. To answer rank(x), we calculate 
i = T[f{x)] and check if x = Xj] if so we return j — I, otherwise return —1. Supporting 
select is straightforward, as we store the elements in sorted order in A. 

Otherwise, if n < \/lgm, let s = and r = [(lgm)/s], and note that r = O(n^). 

We divide the [Igm] -bit representation of each x E S into r contiguous pieces, where each 
piece has size exactly s bits, except for one piece (consisting, say, of the most significant 
bits of x) which has size s' bits, 1 < s' < s. We number the parts 0, . . . , r — 1 with being 
the most significant. Since n < y/lgm, s > 1, and this is possible. Then there exists a set 
i? C [r], \R\ = n, such that if we consider only the bits in the parts that belong to R, all keys 
in S are still distinct [2]. Let h{x,R) be the number obtained by extracting the bits in x's 
representation from parts that belong to R, and concatenating them from most significant 
to least significant. Then for any distinct x,y & S, h{x,R) 7^ h{y,R). Similarly let q{x,R) 
be the number obtained by extracting the bits in x's representation from parts that do not 
belong to R, and concatenating them from most significant to least significant E] The set 5* 
is represented as follows. 

First, we store an implicit representation of R; this takes Ig = nlgn + 0{n) bits. 
Then, we store the sequences h{xi, R), h{x2, R), . . . , h{xn, R) and q{xi, R), q{x2, R), . . . , q{xn, R) 
in that order. Clearly, this representation takes n Igm + nlgn + 0{n) bits. 

To answer rank(x), we read R first; as nlgn = o(lgm) this can be done in 0(1) time. 
Then we compute h{x, R) in 0(1) time. We then read h{xi, R), . . . , h{xn, R)', since h{xi, R) 
is 0((lgm)/n) bits long, all these values can be read in 0(1) time. We then find an i such 



^It appears to be difficult to compute h{x, R) and q(x, R) in 0(f ) time on the RAM model. 
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that h{xi,R) = h{x,R); if such an i exists, we verify the match by reading q{xi,R) and 
comparing it with q{x, R), and return i — 1 or —1 as appropriate. If such an i does not exist 
then X ^ S. It is easy to see that select can also be supported in constant time using this 
representation. □ 
Using the representation of Lemma [5. II instead of Lemma [2.41 for representing the sets at 
the bottom level in the proof of Theorem 13.11 we get an indexable dictionary data structure 
that takes n [Igm] + o{n) bits. One can use this data structure to get a result similar to 
Lemma IXn but without the additive O(lglgm) term in the space complexity. Thus we have: 



Lemma 5.2 Let Si, S2, ■ ■ ■ , Sg all contained in [m] be given sets with Si containing ui 
elements, such that X]i=i = Then this collection of sets can be represented using 
n [Igm] +o(n) bits where the operations rank(x, Si) and select(j, Si) can be supported in con- 
stant time in the cell probe model, for any x G [m], I < j < n and 1 < i < s. This requires 
that we have access to a constant-time oracle which returns the prefix sums of the rii values. 

Using Lemma 15.21 in place of Lemma 13.11 in Theorem 14.11 we get the following result for 
the cell probe model: 

Theorem 5.1 There is an indexable dictionary for a set S C [m] of size n using B{n, m) + 
o(n) bits in the cell probe model. 

As an immediate corollary we get: 

Corollary 5.1 There is an indexable dictionary for a set S C [m] of size n using at most 
n \\g ni\ bits in the cell probe model. 

6 Extensions and applications 

In this section, we give some extensions and applications of our succinct indexable dictionary 
(Theorem 14.11) as well as our fully indexable dictionary for dense sets (Corollary 14. ip . 

6.1 Multiple Indexable Dictionaries 

Here, using our succinct indexable dictionary, we will give a better representation for multiple 
indexable dictionaries, improving on Lemma 13.11 

Let 5*0, 5*1, ... , Sg-i all contained in [m] be a given sequence of dictionaries with Si con- 
taining Ui elements, such that YIiIZq = t^- Note that the representation in Lemma 13.11 of 
these multiple dictionaries requires an oracle to specify the starting point of each dictionary 
in the sequence. The representation we develop here does not make use of this assumption, 
but instead requires that s = 0{n). Define the set S as follows: 

•S* = j) : i e [s\,i G [m] and j G Si). 
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We map the pairs G [s],j G [m] to integers in the range [nis] using the obvious 

mapping ^ i ■ m + j . We represent the n-element set S using our indexable dictionary 
representation of Theorem 14.11 which takes B{n,ms) + o{n) + O(lglgms) bits. As any n- 
element subset of [ms] corresponds to a unique sequence of s sets (using the inverse of the 
above mapping), the first term B{n, ms) is the minimum number of bits required to represent 
such a sequence of multiple dictionaries. 

Now to support the multiple dictionary operations rank(a;,S'j) and se\ect{j, Si), we need 
to find the rank of {i, 0) in S even if {i, 0) ^ S. We can do this by a more detailed inspection 
of the proof of Theorem 14. H and potentially modifying S slightly. 

If ms < 4n^lgn, then the set S is dense and so this follows from Lemma 14.11 If 
ms > 4n^/\gn then we alter m to a new and carefully-chosen value m', and redefine S 
with the new value of m'; more precisely the pairs in S stay the same, but we change the 
mapping that takes pairs to integers as ^ i - m' + j . By doing this, we ensure that no 
bucket at the top level of Theorem 14.11 contains elements of the form {x,y) and {x',y') for 
X x' (i.e., all elements in a bucket have the same first co-ordinate). Thus, answering rank 
queries for {x, 0) only requires summing up the sizes of a number of top-level buckets, which 
is supported by the top level representation. 

We now discuss the choice of m'. Recall that if we apply Theorem 4.1 directly to S, we 
would choose an integer / such that n^/[gn < [ms/2''\ < 2n^/[gn and place x in the bucket 
. Let / be this integer, and let m' = 2' ■ \m/2^ ] , i.e., round the value of m to the next 
higher multiple of 2K Now it is easy to verify that \_{x ■ m' + y) /2'J ^ \_{x' ■ m' + y')j2}\ for 
X 7^ x', and thus keys belonging to distinct dictionaries are mapped to different buckets. 

However, this increases the universe size to m's from ms. Due to this increase, a direct 
application of Theorem 14.11 may result in the elements being bucketed according to the 
mapping x ^ [x/2' J, for some I' > I. This issue is most easily dealt with by noting that 
a.s m' < m + 2\ m's < ms{l + 2^/m) = ms{l + Q{s/{n^/lgn))) = ms{l + 0{l/^/lgn)) 
(recall that s = 0{n) by assumption). This in particular means that, for n larger than some 
constant, m's < 2ms, and so retaining the mapping x h-h> [a;/2^J in the proof of Theorem 14.11 
gives at most Auy/lgn buckets at the top level, which is immaterial. More importantly, 
since m's = ms{l + 0(1/ y/lgn)), the increase in the space is only in the lower-order terms 
by Proposition 14.11 With this additional power, we now support the multiple dictionary 
operations as follows: 

• To find the size of the set Si, we do the following. Find the rank of {i + 1, 0) and the 
rank of {i, 0). The difference gives the size of the set Si. 

• To perform select(2, S'j), find the rank r of (j, 0) and then do select(r + i) in S. The 
second coordinate of the element returned by the select operation is the value of the 
i-th smallest element of Sj. 

• To find rank(x, S'j), find and subtract the rank of (j, 0) from rank((j, x)). Return the 
result if rank((j, x)) > and return —1 otherwise. 

Thus we have: 
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Theorem 6.1 Let Sq, Si, . . . , Sg-i all contained in [m] be a given sequence of s = 0{n) 
sets with Si containing Ui elements, such that J2i=i''^i = Then this collection of sets can 
be represented using B{n,ms) + o(n) + O(lglgm) bits and the rank(x, S'j) and select(j, S'j) 
operations can be supported in constant time for any x G [m],i G [s] and j G {1, . . . ,ni}- 
We can also find Ui for each i in constant time. The first term in the space bound is the 
minimum number of bits required to represent such a sequence of sets. 

We use Theorem 16.11 in the next section to represent k-arj trees. However, Theorem 16.11 
has several direct applications. For instance, it can be used to represent an arbitrary directed 
graph on n nodes, where the vertices are numbered to n — 1 and Si C [n] represents the 
set of neighbors of vertex i. The space used — B{r, rt^) + o(r) bits, where r is the number of 
edges — is information-theoretically optimal, and the representation supports the union of 
the operations supported in 0(1) time by the standard adjacency list and adjacency matrix 
representations, such as adjacency testing, or iteration over the list of neighbors of a given 
vertex. However, it also supports constant-time operations not supported in 0(1) time by 
either of the standard representations, including random access to the i-th neighbor of a 
vertex and reporting the out-degree of a vertex. 



6.2 Representing /c-ary Cardinal Trees 

Recall that a fc-ary cardinal tree is a rooted tree, each node of which has k positions labeled 
0, . . . , /c— 1, which can contain edges to children. As noted in the introduction, the space lower 
bound for representing a fc-ary cardinal tree with n nodes is C{n,k) = Ig ( ^J^i (^"n*"^)) • 
We now give a succinct representation of fc-ary cardinal trees that supports a number of 
operations in 0(1) time. Given a node, we can go to its child labelled j (i.e. the child 
reachable with an edge in the position labelled j), its z-th child or to its parent if these nodes 
exist. In addition, we can determine the degree of a node as well as the ordinal position of a 
node among its siblings in constant time. The representation uses C{n, k) + o{n) + 0(lg Ig k) 
bits of space; the space usage is therefore information-theoretically optimal up to o(n + Ig k) 
terms, and is more space-efficient than the representation of j5]. Unfortunately, we are not 
able to support the subtree size operation in constant time using this representation. Our 
representation imposes a numbering from to n — 1 on the nodes (the representation of [5j 
also imposes a numbering, albeit a different one, on the nodes). 

Theorem 6.2 A k-ary tree on n nodes can be represented using C{n, k) + o{n) + 0(lglg A;) 
bits where given a node of the tree, we can go to its i-th child or to its child labeled j or to 
its parent if they exist, all in constant time. In addition, we can determine the degree of a 
node as well as the ordinal position of a node among its siblings in constant time. 

Proof: Consider a level-ordered left-to-right numbering of the tree nodes by numbers from 
{0, . . . ,n — 1}, starting from the root with 0. From now on, we refer to the nodes of the tree 
by these numbers. By a child labeled j of a node x, we mean the child y of x such that the 
edge {x,y) is labeled j. Let Sx be the set of edge labels out of the vertex x. Then the sets 
Sq, . . . , Sn-i form a sequence of n sets of total size n — 1, each being a subset of [k]. 
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Representing these multiple dictionaries using Theorem 16.11 we get a representation for 
the fc-ary tree using at most B{n — 1, kn) + o{n) + 0{\g\g{kn)) bits. Since (J^J = ■fen+T ('^"^^) ' 
B{n — 1,/cn) + o{n) + 0(lglg(fcn)) = B{n,kn + 1) — lg(A;n + 1) + o{n) + 0(lglgA;n) = 
C{n,k) + o{n) + 0{lg\gk) bits. By Theorem 16.11 we can support the degree of a node x, 
the i-th child of a node x, and the ordinal position (the local rank) of the child labeled j, if 
exists, of a node x, all in constant time. However, the basic navigational operations of going 
to a child or to the parent are not supported. To support these, we re-examine the proof of 
Theorem 16. 1[ Note that in applying Theorem 16. II to represent our tree, the following set S 
is stored in an indexable dictionary: 

S = {{x, j) : X & [n], j & [k] and 3 an edge labeled j out of node x}. 

The representation supports rank((x, j), S') and select((x, j), S") in 0(1) time. It is easy to 
verify that: 

• rank((x, j), S) + 1 gives the label of the child labeled j of node x, if it exists, and returns 

otherwise. 

• The first component of select(2, S) is the parent of the node i. I.e., if the i-th element 
in S is {x,j), then x is the parent of the node i, for i > 0. 

□ 

6.3 Multisets 

Given a multiset M from U = [m], \M\ = n, an indexable multiset representation for M 
must support the following two operations in constant time: 

rankm(x,M) Given x G f/, return —1 if x ^ M and \{y G M\y < x}\ otherwise, and 

selectm(z, M) Given i G {1, . . . , n}, return the largest element x G M such that rankm(x) < 

1 - 1. 



We also consider a generalization of the ran km operation: 

fullrankm(a;) Given x E U, return \{y G M\y < x}\. 

There is an intimate connection between FIDs and multisets similar to that in Lemma 
12. li as shown below. 

Lemma 6.1 Suppose there is an FID representation for any given setT C U using f{\T\, \U\) 
bits of space. Then given a multiset M of n elements from the universe [m], there is a data 
structure to represent M using f{n,m + n) bits of space that supports full ran km and selectm 
operations in constant time. 
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Proof: Consider the m + n bit representation of M obtained as follows. For i = to m — 1, 
represent i by a 1 followed by rii Os where is the number of copies of the element i present 
in the set M. Clearly this representation takes m + n bits since it has m Is and n Os. 

View this bit sequence as a characteristic vector of a set T of m elements from the universe 
[m + n]. Represent T as an FID using f{m, m + n) = f{n,m + n) bits. It is easy to verify 
that fullrankm(x, M) = select(x, T) — x + 1 and selectm(i, M) = select(z, T) — i. (Note that 
[m + n] starts with element 0.) □ 

The following corollary follows from Corollary 14.11 and Lemma 16. 1[ 

Corollary 6.1 Given a multiset M of n elements from the universe [m], there is a data 
structure to represent M and to support fullrankm and selectm operations in constant time 
using B{n, m + n) + o{n) bits of space, provided that m = 0{ny/lgn) . 

We now develop an indexable multiset representation (that supports only ran km and 
selectm operations) taking B{n, m + n) + o{n) + O(lglgm) bits for all n. As was alluded to 
in the introduction (see yUj), the first term is the minimum number of bits required to store 
such a multiset. 

Theorem 6.3 Given a multiset M of n elements from [m], there is an indexable multiset 
representation of M that uses B{n, m + n) + o{n) + O(lglgm) bits. 

Proof: If n is dense in m, i.e. if m = 0{n^/lgn) then the theorem follows from Corollary 

If not, then we represent M as follows. First represent the set S of distinct elements 
present in M using the indexable dictionary data structure of Theorem 14. II using B{n', m) + 
o{n) + O(lglgm) bits where n' < n is the number of distinct elements present in M. 

Then represent the rank information separately by representing each element i present 
in M (in increasing order) by a 1 followed by nj — 1 Os where rzj is the multiplicity of the 
element i in M. This representation is a bitstring of length n with n' I's. This bitstring 
could be considered as a characteristic vector of a set i? C [n] with \R\ = n' . Let R = [n] \R. 

Now to find rankm(a;, M), first find rank(x, S"). If the answer is —1, then return —1. 
Otherwise rankm(a;, M) is select(rank(a;, 5)+!, R). To find selectm(z, M), let r = rank(i, R)+l 
if rank(2, R) > and r = i — rank(i, R) otherwise. The value r is precisely the number of I's 
up to and including i in the characteristic vector of R. Then selectm (i, M) = select(r, 5"). 

To support both rankm(x, M) and selectm(z,M) in constant time in this way, we need 
a fully indexable dictionary for R. If n' is dense in n, i.e. n = 0{n' y/lg n'), then use the 
fully indexable dictionary of Corollary 14. II for R. This uses B{n', n) + o{n') bits for a total of 
B{n',m) + B{n',n) + o{n) + O(lglgm) including the space for representing S. Clearly this 
space is B{n', m) + B{n — n', n) + o{n) + O(lglgm) which is at most B{n, m + n) + o{n) + 
0(lg Igm). 

Otherwise represent R using the FID representation of Lemma 12.21 which uses n + o{n) 
bits. Since n' is sparse in n, n' < en / y/log n (for some constant c) in which case B{n' m)+n < 
B{n,m)+o{n). To see this, note that (™) = > 2(„™^) since {m-n+l)/n > 2 for 

sufficiently large m and n < dm / y/\g m for some constant d. Hence B{n, m) > B{n — 1, m) + l 
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and so B{n, m) > B{n', m) + n — n' . That is, B{n\ m) + n < B{n, m) +n' < B{n, m) + o(n). 
Finally, B{n, m) + o{n) < B{n, m + n) + o{n), by Proposition 14. II as n is also sparse in m. □ 

6.4 Applications of Succinct FIDs 

Here we give more applications of our FID for dense sets obtained in Corollary 14. ll to represent 
sets, multisets and prefix sums. 

6.4.1 Set data structure with select 

Theorem 6.4 There is a representation of a set S C [m] of size n that uses at most 
B{n,m) + o{n) bits and supports the select operation in constant time. 

Proof: The proof is essentially as in the proof of Theorem 14.11 except that we store each of 
the R's sorted list. 

To perform a select(z) operation, we first perform a pred(i) at the top level prefix sum 
representation, to find the bucket Bj in which the i-th element is present. Then a sum(j) 
operation at the top level representation gives the prefix sum of the first j — 1 bucket sizes. 
Now i — sum(j) is the rank of the element in the bucket Bj in which we are interested. Since 
the buckets are sorted, it is easy to find the element of appropriate rank in that bucket. □ 

6.4.2 Multiset with selectm and Prefix Sums 

Theorem 6.5 Given a multiset M of n elements from [m], there is a representation of M 
that uses B{n, m + n) + o{n) bits that supports selectm operation in constant time. 

Proof: Use the encoding given in the proof of Lemma 16.11 to convert the multiset M into a 
set T C [m + n] of size n. Represent T using the representation of Theorem 16.41 which uses 
B{n, m + n) + o{n) bits and supports select operation on T. From the proof of Lemma 16. H 
we know that selectm(i,M) = select(z,T) — i. The theorem follows. □ 
As an immediate corollary we get: 

Corollary 6.2 Given a sequence X = xi, . . . ,Xn of non-negative integers such that J27=i = 
m, the sequence can be represented using B{n, m + n) + o{n) bits to support the partial sum 
query sum(i,X) in constant time. The first term is the information theoretically minimum 
number of bits required to represent such a sequence. 

Proof: Consider the multiset M of partial sum values M = {J2)=i Xj : 1 < i < n}. As the XjS 
are non-negative and add up to m, M C [m]. Represent this multiset using Theorem 16.51 and 
observe that sum(i, X) = selectm(z, M). Also, as the mapping from X to M is invertible, the 
information theoretic minimum number of bits required to store the partial sum information 
is B{n, m + n). The result follows. □ 
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7 Optimality considerations 



As mentioned in the introduction, some of the space bounds we show above may actually 
be very far from the information-theoretically optimal bound of B{n,m). Recall that, for 
example, in the context of storing a set of size n from [m], the information-theoretic lower 
bound of B{n, m) bits may be dwarfed by additive terms of o{n) bits, say when n = m — c 
for some constant c. We now note that this is unavoidable to some extent, and in particular 
that achieving a space bound even polynomial in the information-theoretic lower bound and 
preserving constant query time is impossible for several of the problems that we consider in 
this paper, namely: 

• Supporting rank queries on a set of integers; 

• Supporting select queries on a set of integers and 

• Supporting sum queries on a sequence of non- negative integers (or equivalently, sup- 
porting selectm on a multiset of integers). 

We show that the fullrank problem reduces to all of these. Given a set 5* of size n from 
U = [m], recall that fullrank(x, S) returns the rank of x in for any x & U. Beame and Fich 
[H Corollary 3.10] showed the following: 

Lemma 7.1 Given a set S C [m], \S\ = n, any data structure that uses rf"^^^ words of space 
would require VL[^\g n/ Ig Ign) time in the worst case to answer fullrank queries in the cell 
probe model with word size (Igm)'^*^-'^). 

Lemma 7.2 Given a set S C [m], IS"! = n, one cannot support rank queries on S in 0(1) 
time for all n using (B{n,m))'^^^^ bits in the cell probe model with word size (Igm)*^^"^-* bits. 

Proof: Suppose that the statement of the lemma is false. Then we would solve fullrank queries 
on S in 0(1) time using n^^^'^ words of space as follows, contradicting Lemma [7.11 Letting 
B = B{n,m), by assumption we can store S in B"^^^^ bits, which is (nlgm)*^*^^) bits of space, 
and answer rank queries on S in 0(1) time. Similarly, we can store S in B{m — n^m) = n^^^^ 
words of space and answer rank queries on S in 0(1) time. To answer fullrank(a;, 5) for all 
X G f/, we first compute rank(x, 5*); if the value returned is not —1 we return it as the answer 
to fullrank, otherwise we return x — rank(a:;, 5) — 1 as the answer to fullrank. □ 

Lemma 7.3 Given a set S C [m], l^l = n, one cannot support select queries in 0(1) time 
for all n using (B{n,m))'^^^^ bits in the cell probe model with word size (Igm)*^*^^) bits. 

Proof: Suppose that the statement of the lemma is false. Then given T C [m*], where 
T = {ti, . . . ,tn*} and ti < ^2 < • • • < ^n*, we can answer fullrank queries on T in 0(1) time 
using (n*)*^^^) words as follows, contradicting Lemma [7.11 We create a bit- vector by writing 
down ti Os followed by a 1, then (^2 — ^i) Os followed by a 1, and so on and finally we write 
{m* — tn) Os. This is a bit vector with n* Is and m* Os; we view this is as a characteristic 
vector of a set S of size n = n* from [m] where m = m* +n*. We store S using {B{n, m))*^^^-* 
bits = nP^^^ words and compute fullrank(x, T) as select(x, S*) — x in 0(1) time. □ 
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Lemma 7.4 Given a sequence X = xi, . . . ,Xn of non-negative integers adding up to m, one 
cannot store this sequence in (i3(n,m + n))0(^) Mts of space for all m, n and support the sum 
query on this sequence in 0(1) time in the cell probe model with word size (lg(m + n))'^^^^ 
hits. 

Proof: Suppose that the statement of the lemma is false. Then given S = {si, . . . , s„*} C 
[m*] where Si < S2 < . • • < s„* we could answer full rank queries on S in 0(1) time using 
{n*f^^^ words of space as follows, contradicting Lemma 17. 1[ Create a sequence X which 
consists of Si Os followed by a 1, and for z = 2 to n, Sj — — 1 Os followed by a 1 and 
finally m* — s„* — 1 Os followed by a 1. This is a sequence of n = m* + 1 non- negative 
integers adding to m = n*. We store this sequence using (B{n,m + n))'^^^^ bits, but since 
B{n,m + n) = B{m,m + n) = B{n*, m* + n* + 1) < B{n*, 2m* + 1), the space usage is (n*)*^^^-* 
words. It is easy to verify that fullrank(j, S) = sum(j, X). □ 

8 Conclusions 

We have given a static data structure for storing an n element subset of an m element 
universe, that takes B{n,m) + o(n) + O(lglgm) bits of space and supports rank and select 
operations in constant time (an indexable dictionary) on the RAM model of computation. 
B{n, m) is the information theoretically optimal number of bits needed to store a subset of 
size n from an m-element universe. By modifying our indexable dictionary for the RAM 
model, we obtained an indexable dictionary representation that uses B{n, m) + o{n) bits 
in the cell probe model. This, in particular, implies that n words (of size [Igm] bits) are 
sufficient to represent n elements from an m element universe and answer membership queries 
in 0(1) time on the cell probe model, answering a question raised by Fich and Miltersen [12j 
and Pagh [21]. 

Using the indexable dictionary representation for the RAM model, we have developed 
improved succinct representations for a number of objects. We have shown that a A;-ary tree 
on n nodes can be represented using C(n, k) + o{n) + 0(lglg A;) bits of space and support all 
the navigational operations, except the subtree size of a given node, in constant time. Here 
C(n, k) is the information-theoretically optimum number of bits required to represent a fc-ary 
tree on n nodes. We also developed a succinct representation for an indexable multiset of n 
elements from an m element universe using B{n, m + n) + o{n) + O(lglgm) bits. 

An important subroutine used by the indexable dictionary is a space-efficient fully- 
indexable dictionary (FID) which simultaneously supports rank and select on a set and 
its complement. This data structure, which is functionally equivalent to supporting ranko/i 
and selecto/i on a bit-vector, occupies B{n,m) + o(m) bits and supports all operations in 
0(1) time on the RAM model. We gave further applications of this result, most notably, to 
representing a sequence of non-negative integers in information-theoretically optimal space, 
while supporting prefix sum queries in 0(1) time. 

We have focussed on space utilization and (static) query time, and have not given exten- 
sive consideration either to the time and space required for pre-processing, or to dynamizing 
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the data structure. As regards the pre-processing time, we note that an indexable dictionary 
can be used to sort the input set S, so pre-processing must take at least as much time as 
the best algorithm for sorting integers. Assuming S is presented in sorted order, however, 
the main bottleneck is the creation of hash functions as required by Lemma 12.51 The hash 
functions can be found rapidly using randomization, yielding a linear expected time pre- 
processing algorithm. As regards dynamization, the work of [H] gives a lower bound of 
fi(lg?7,/ Iglgra) time for both rank and select, when 5* is allowed to change by insertions or 
deletions. 

Some open problems that remain are: 

1. Is there a succinct indexable dictionary taking B{n, m) + o{n) bits in the RAM model? 

2. Is there a representation for k-aiy trees taking C{n,k) + o{n) + 0{\g\gk) bits that 
can also support subtree size operation besides the other navigational operations in 
constant time? 

Acknowledgments. We thank the anonymous TALG referee who helped us significantly 
to improve the presentation of the paper. 
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